Learning From Sparse Demonstrations
نویسندگان
چکیده
In this article, we develop the method of continuous Pontryagin differentiable programming (Continuous PDP), which enables a robot to learn an objective function from few sparsely demonstrated keyframes. The keyframes, labeled with some time stamps, are desired task-space outputs, is expected follow sequentially. stamps keyframes can be different robot's actual execution. jointly finds and time-warping such that resulting trajectory sequentially follows minimal discrepancy loss. Continuous PDP minimizes loss using projected gradient descent by efficiently solving respect unknown parameters. first evaluated on simulated arm then applied 6-DoF quadrotor for motion planning in unmodeled environments. results show efficiency method, its ability handle misalignment between execution, generalization learning into unseen conditions.
منابع مشابه
Learning from Limited Demonstrations
We propose a Learning from Demonstration (LfD) algorithm which leverages expert data, even if they are very few or inaccurate. We achieve this by using both expert data, as well as reinforcement signals gathered through trial-and-error interactions with the environment. The key idea of our approach, Approximate Policy Iteration with Demonstration (APID), is that expert’s suggestions are used to...
متن کاملRobot Learning from Failed Demonstrations
Robot learning from demonstration (RLfD) seeks to enable lay users to encode desired robot behaviors as autonomous controllers. Current work uses a human’s demonstration of the target task to initialize the robot’s policy, and then improves its performance either through practice (with a known reward function), or additional human interaction. In this article, we focus on the initialization ste...
متن کاملReinforcement Learning from Imperfect Demonstrations
Robust real-world learning should benefit from both demonstrations and interaction with the environment. Current approaches to learning from demonstration and reward perform supervised learning on expert demonstration data and use reinforcement learning to further improve performance based on reward from the environment. These tasks have divergent losses which are difficult to jointly optimize;...
متن کاملLearning Skills from Human Demonstrations
Many robots are designed for use in domestic environments where robots will be engaged in household chores. The robots need to learn ways to do the household chores that humans are now doing. We are taking a learning from demonstration (LfD) approach to this problem [1]. In terms of the household chores, a number of tasks are developed so far; for example, bringing a beer bottle from a refriger...
متن کاملLearning From Demonstrations via Structured Prediction
Demonstrations from a teacher are invaluable to any student trying to learn a given behavior. Used correctly, demonstrations can speed up both human and machine learning by orders of magnitude. An important question, then, is how best to extract the knowledge encoded by the teacher in these demonstrations. In this paper, we present a method of learning from demonstrations that leverages some of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Robotics
سال: 2023
ISSN: ['1552-3098', '1941-0468', '1546-1904']
DOI: https://doi.org/10.1109/tro.2022.3191592